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20.  Abstract 


from  the  Pyke-Shorack  (1968)  results.  An  expression  for  the 
centering  constant  is  obtained,  thus  unifying  the  H^jek  (1968) 
and  Hoeffding  (1973)  results.  Asymptotic  degeneracy  of  such 
statistics  is  considered. 
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For  the  important  special  ca  e arising  when  one  takes  for  some 


m < n,  C ,= 
’ nl 

and  F 


= C = - and  C , , 
nm  m nnH-1 


, C = 0;  F . 
nn  ’ nl 


= F 


nm 


= Fnn=  G,  Chernoff-Savage  (1958)  obtained  the  asymptotic 


‘nnH-1 

normality  of  Sn  with  some  restrictions  on  the  growth  rates  of  the  first 
two  derivatives  of  cp.  The  same  results  have  since  been  proved  by 
Govindarajulu-Le  Cam-Raghavachari  (1966)  and  Puri-Sen  (1971)  with  milder 
conditions  on  cp.  A far-reaching  generalization  of  these  results  is  due 
to  Hajek  (1968).  In  a remarkable  paper,  Pyke-Shorack  (1968)  provided  an 

' UCCESIWJW 

alternative  approach  to  the  Chernoff-Savage  theorems  by  considering  the  „Tts 

ore 

weak,  convergence  of  a two  sample  engiirical  process  to  a Gaussian  process ; i(^*v.;sou;«ctrB 
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This  approach  has  had  a very  significant  impact  on  research  practices  in 


the  area  of  nonparametric  statistics  and  there  now  exists  a substantial 
body  of  literature  devoted  to  stochastic  processes  arising  in  such  pro- 
blems. In  spite  of  this,  however,  techniques  of  neither  the  Pyke-Shorack 
kind  nor  of  the  Chernoff-Savage  kind  have  been  applied  to  problems  of  the 
regression  type.  The  method  of  attack  for  such  problem  has  for  the  most 


part  been  along  the  lines  of  Hajek  (1968),  Puri-Sen  (1969),  Hoeffding  (1973), 
to  name  a few.  We  have  been  able  to  obtain  results  of  the  type  given  in 
Hajek  (1968)  both  by  the  Chernoff-Savage  approach  and  by  a method  involving 
convergence  of  certain  stochastic  processes.  The  latter  is  given  in  this 
paper;  the  former,  together  with  some  results  on  convergence  rates  is 
published  separately.  (See  Puri  -Raj  aram  (1977).) 

As  is  well  known,  the  classical  weak  convergence  method^to  be  found, 
for  example,  in  Billingsley  (1968)  or  Pyke-Shorack  (1968)  generally  entail 
securing  certain  bounds  on  the  sample  path  fluctuations  of  the  processes 
involved,  usually  with  the  help  of  certain  refined  metrics.  If  the  distri- 
butions are  allowed  to  vary  a great  deal,  any  analysis  of  the  sample  paths 
can  become  extremely  involved.  Further,  unlike  statistics  of  the 
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Kolmogorov-Smirnov  type  which  axe  measures  of  the  maximum  sample  path 
fluctuation,  statistics  of  the  regression  type  are  weighted  sums 
involving  ranks  and  any  sample  path  behavior  as  such  is  not  directly 
relevant.  Taking  these  factors  into  consideration,  we  have  adopted  an 
approach  which  uses  only  the  second  order  properties  of  the  processes 
involved,  thus  circumventing  the  need  for  sample  path  analysis. 

It  is  well  known  that  a second  order  process,  that  is  to  say  a 
stochastic  process  X(t)  whose  variance  K(t,  t)  is  finite  for  all  values  of 
t,  spans  a reproducing  kernel  Hilbert  space  with  kernel  K(s,  t)  = 

Cov(X(s),  X(t)).  We  exploit  the  fact  that  linear  rank  statistics  can  be 
approximated  by  elements  of  the  Hilbert  space  spanned  by  suitably  defined 
empirical  processes.  In  general,  such  a Hilbert  space  does  not 
characterize  the  process  spanning  it  completely.  However,  the  spanning 
process  if  Gaussian,  is  completely  determined  by  its  Hilbert  space.  We 
establish  the  asymptotic  normality  of  linear  rank  statistics  by  proving 
their  convergence  to  a point  in  the  Hilbert  space  spanned  by  a Gaussian 
process.  Integrals  of  th^  type  considered  here  are  used  in  a wide  range 
of  applications;  Parzen  (1958),  for  example,  has  used  them  in  his  basic 
work  in  time  series  analysis.  But  the  application  given  here  for  a pro- 
blem in  nonparametric  statistics  appears  to  be  new.  Brown  (1970)  has 
some  interesting  results  on  convergence  in  distribution  of  stochastic 
integrals.  The  basic  reference  on  reproducing  kernel  Hilbert  spaces  is 
Aronszajn  (1950).  Yosida  (1972)  also  has  useful  results  on  reproducing 
kernels . 

Our  conditions  on  the  score  generating  functions  are  the  same  as 
those  in  Puri-Sen  (1971) , being  somewhat  stronger  than  those  of  Hajek  (1968) 
but  less  stringent  than  in  Chernoff-Savage  (1958).  However,  the  centering 
constant,  is  used  tliroughout  in  the  place  of  E(S^)  used  by  Hajek.  This, 
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in  addition  to  being  easier  to  com; ute,  serves  to  unify  the  results  of 
Hajek  (1968)  and  Hoeffding  (1973)-  We  also  examine  the  asymptotic  degeneracy 
of  and  in  this  respect  our  results  extend  both  the  Hajek  and  the 
Chernoff- Savage , Puri-Sen  results. 


2.  Definitions  arid  Terminology. 


Let  X ,,  X •••  X be  a sequence  of  independent  random  variables 
nl  nn 


with  continuous  distribution  functions  £’nl>  F^,  **’»  ^nn  respectively. 

Let  R . be  the  rank  of  X . among  (X  ,,*•*.  X ) , that  is 
ni  ni  nl’  nn’  ’ 


R . = (number  of  X . < X . ) : j = 1,  — , n. 
ni  nj  — ni  ’ ’ ’ ’ 


We  define  a simple  linear  rank  statistic  by 

(2.1) 

where,  (C^,  ••*,  Cnri),  (a^l),  •••,  a^(n) ) are  known  constants,  C^_.  being 


S = S C . a (R  .) 
n . ni  n ni 
i=l 


nn' ' ' n'  ' ' ' n • • ' nl 

known  as  regression  constants  and  a (i)  as  scores.  One  usually  assumes 


n 


a^(i)  to  be  generated  by  a known  real-valued  function  c p defined  on  (0,  l) 


in  either  of  the  following  ways: 


(2.2)  (a) 


an(i)=<P(^  W an(i)  = E cp(U^i)) 


,(i) 


where  is  the  ith  order  statistic  among  n independent  random  variables 


distributed  uniformly  over  (0,  1).  We  shall  assume  (2.2)  (a),  namely, 
an(i)  = cp(^-r) . Extension  to  case  (b)  can  be  done  in  one  of  the  several 
ways  but  has  little  to  do  with  the  techniques  developed  in  this  paper  and 
need  not  concern  us.  Thus,  by  (2.2)  (a),  we  can  write  as 

n 


(2.3) 


R . 

S = Sc.  cp(-“). 
n ni  Y'n+1’ 


For  a fixed  x,  we  define  the  empirical  distribution  function 


H (x)  for  the  sample  X X , by 

n'  ’ 1 nl’  ’ nn’  J 


. —i-*  - ■- 


Pyke-Shorack  (1968)  consider  the  two-sarrple  empirical  process  defined  "by 

(2.7)  Ln(t)  =vTn  (FjH'^t))  - F(H_1(t))} 

where  F (x)  is  the  empirical  distribution  function  corresponding  tD  the 
m 

sample  (X^,  ' * * , Xffl) . They  obtain  the  asymptotic  normality  of  Sn 
^responding  to  the  two  sample  cases  (the  Chernoff-Savage  theorem)  by 
proving  the  weak  convergence  of  Ln(t)  to  a Gaussian  process  and  also  its 
convergence  in  some  refined  metrics.  Thus  for  the  general  regression  case, 
that  is,  when  Sn  is  defined  by  (2.1)  or  (2.3),  it  is  natural  to  consider 
the  process  defined  by 

^ _1  1 

(2.8)  L (t)  = £ C . (I(X  . < H (t))  - F .(H  (t))}. 

Unfortunately  this  process  is  much  too  complex  and  classical  weak  conver- 
gence methods  entail  a detailed  study  of  the  sample  path  fluctuations  of 
the  process.  Hence,  we  will  avoid  this  by  basing  our  approach  on  the 
second  order  behavior. 
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3.  Stochastic  Integrals  of  Second  Order  Stochastic  Processes. 

Let  {X(t):  t f T)  be  a real- valued  second  order  Stochastic  process 
with  the  kernel  K(s,  t)  = E [X(s)  X(t)].  Let  {Y(t):  t £ T}  be  another  real- 
valued second  order  stochastic  process  with  the  kernel  R(s,  t)  = E[Y(s)  Y(t)], 
We  always  assume  that  T is  an  interval  on  the  real  line. 

Let  T = [a,  b]  where  a < b are  real  numbers.  We  define  the  second 
order  stochastic  integral  of  X(t)  with  respect  to  Y(t)  as  follows.  We 
follow  essentially  Lo^ve  (1963). 

Let  D„:  a = t,  < t_  < • • • < t , = b. 

T 12  n+1 

Define  the  random  step  function 


(3.1) 


where 


JT  4*>d  Y(t)  . S Xk  (Y(Vl)  - Y(tk)) 

X X. 


Then  the  stochastic  integral  (second  order)  of  X(t)  with  respect 
to  Y(t)  is  defined  by 


(3.2) 


f X(t)  d Y(t)  = l.i.m.  r 3L  (t)  d Y(t) 

* m I 4-  4-  I .r\  rn 


max|t  -th°  T 
l<k<n 


if  it  exists,  where  l.i.m.  means  limit  in  the  quadratic  mean. 

Often  we  will  need  to  define  it  as  an  improper  integral  on  T = (a,b), 
Then  we  take  T'  = [a, (3],  a < a < g < b and  first  define  as  before 


(3.3) 


r x(t)  d Y(t) 

* m 1 


and  then  define  the  improper  integral 


(3.4) 


[ X(t)  d Y(t)  = l.i.m.  [ X(t)  d Y(t) 
J T o^a  J T ' 

g-b 


if  it  exists. 
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Remark  1.  These  integrals  d pend  on  the  "increments"  of  Y(t)  and 
not  directly  on  Y(t)  itself. 

The  following  theorem  ensures  the  existence  of  such  an  integral. 

Theorem  3.1.  Let  the  second  order  stochastic  process  X(t)  with  the 

kernel  K(s,  t)  he  independent  of  the  "increments"  of  the  second  order  process 

Y(t)  with  the  kernel  R(s,  t) . Then  the  stochastic  integral  [ X(t)  d Y(t) 

J T 

exists  in  the  rani;  of  (5-2)  (or  (3.4))  if  and  only  if 

J J K(s,  t)  dl2  R(s,  t) 

exists  as  a Riemann-Stielt jes  (perhaps  improper)  integral. 

Proof.  See  Lodve  (1963). 


Remark  2 . Very  often  in  applications,  Y(t)  will  he  a nonrandom 
function  g(t).  Then  theorem  3.1  reduces  to  the  following 

Corollary  3.2.  The  stochastic  integral  X(t)  d g(t)  exists  if 

•*  T 

and  only  if  ( K(s,  t)  d g(s)  d g(t)  exists  as  a Riemann-Stielt jes 
^ T ^ T 

(perhaps  improper)  integral. 

Proof.  Immediate  from  theorem  3.1. 

The  following  property  of  Gaussian  processes  proves  very  useful. 
Theorem  3.3:  If  Y(t)  is  a nonrandom  function  integration  of  a 

Gaussian  process  with  respect  to  Y(t)  preserves  normality. 

Proof.  See  Lodve  (1963). 


4 . Second  Order  Properties  of  Weighted  Empirical  Processes. 

Let  X X '*•,  X , •••he  independent  random  variables  with 
nl*  n2  ’ nn’  * 

the  corresponding  distribution  functions  F^,  F^,  ••*,  F^,  •••  which 
are  continuous . We  define 


(4.1) 


F = - E F . . 
n n . , ru 

i=l 


Let  C^,  C^n  be  known  constants.  We  shall  investigate  the 


second  order  properties  of  the  two  processes. 


(4.2) 


X (t)  = E {l(X  . < F (t) ) - F ,(F~ 
n v n ± i m - n v ' ' ruv  n 


-1 


(t))) 


1 _i 

= T~  {I(X  . < F X(t)  - t) . 
Vn  i 1 m-  n ' ' 1 


(4.3) 


y (t)  4EC.  (I(X  . < p;l(t))  - F 0 < t < 1, 


-1/ 


n i=l 


ni 


ni  — n 


ni  n 


where  s^  > 0 is  a normalizing  constant  and  l(  ) is  defined  by, 


(4.4) 


1 if  y > X . 

Kxni<y)  = 

0 otherwise. 


The  process  X^(t)  is,  of  course,  the  well  known  empirical  process 


and  has  been  extensively  studied.  We  shall  closely  study  the  process 

Y (t).  The  process  Y (t)  as  well  as  the  process  L (t)  defined  in  (2.8) 
n n n 3 c 

have  been  considered  by  Koul  (1970)  and  Koul  and  Staudte  (1972)  respectively. 
Our  approach,  however,  is  significantly  different. 


Lemma  4.1:  E X (t)  = E Y (t)  = 0 

n n ' 


(4.5)  E[Xn(s)  Xn(t)]  = Kn(s,  t) 


n 


s At--  E F . (F  1 (s)) 
“ n niv  n v " 

i=l 


F . (F-1(t) ) 
ni  n ' ' 


(4.6)  E(Yn(s)  Yn(t)]  = Rn(s,  t) 


= K S C2.(F  . (F-1(s) ) A F . (F-1(t) ) - F . (F-1( s) ) F . (F_1(t))) 
2 m1  ni  n ni  n ni  n v ' ' ni  n v n 

s 1=1 
n 

where  for  any  two  real  numbers  a and  b,  a A b = min  (a,  b). 


f 


Ft  oof . Proof  of  E Xn(t)  = E Y^(t)  = 0 is  obvious.  Further  (4.5)  is 

a special  case  of  (4.6)  and  hence  it  suffices  to  prove  the  latter  alone. 

Without  loss  of  generality  let  s < t.  Let  u = F ^(s)  < v = F ^(t). 

— n — n 

Then  we  have 

‘ i n 

Y (s)Y  (t)  = £ C .C  .{I(X  . < u)  - F ,(u)}{l(X  . < v)  - F .(v))} 

nv  ' nv  ' 2 . . nx  nj  nx  — nx  ' nj  — ' njv 

sn  1»J_X 


= 4y  £ C .{l(X  . < u)  - F ,(u)}{l(x  . < v)  - F .(v)} 

2 . nx  nx  — ni  ni  — nx 

sn  1=1 

+ £ C .C  . (I(X  . < u)  - F . (u))  (I(X  . < v)  - F .(v)}. 

2 . ni  nj  1 ni  - nx  J 1 no  - ' nn  ' ‘ 

Sn 

On  taking  expectations,  the  expectation  of  the  second  sum  vanishes  and 
is  the  first  sum 


E[(I(X  . < u)  - F . (u)} (l(X  . < v)  - F .(v)}] 
1 nx  - ' nx  J nx  — ' nx  ; 


= F . (u)  A F . (v)  - F .(u)  F . (v). 
nx  ' nx  nx  nx  ' 


This  establishes  (4.6). 


Next  we  examine  the  1^- convergence  of  the  processes  Y (t). 

Definition.  A sequence  Y (t)  of  second  order  processes  converges  in 
to  another  (second  order)  process  Y(t)  if  and  only  if 

||Yn(t)  - Y( t ) ||  = (Var  (Yn(t)  - Y(t))}1/2  0 


as  n — > + oo,  for  each  t. 


We  then  write 


Y (t) 
nv  ' 


Y (t)  - Y(t)||  — > 0. 
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We  next  obtain  criteria  for  Lo  convergence  of  the  processes  Y (t) 

^ n 

defined  in  (4.3). 

Theorem  4.2.  The  second  order  processes  Y (t)  with  R (t)  as  their 
-c nv  ' n • 

kernel  ( defined  by  (4.3)  and  (4.6)  respectively)  converge  to  a second  order 
process  Y(t)  if  and  only  if,  for  each  t. 


(4.7)  lim 


n m 


E{  Yj  2 C ,C  . [I(X  . < F-1(t) ) - F ,(F"x(t))] 
ni  mj  m - n v ' ' ni  n 


s s 

n m i=l  j=l 

n-~f® 


X [I(X  . < F-1( t) ) - F . (F-1( t) ) ] 
ny  - m 


= f(t)  exists. 


Further,  if  (4.7)  holds , the  kernel  of  Y(t)  _is  given  by 
(4.8)  E(Y(s)  Y(t) ] = R(s,  t)  = lim  Rn(s,  t) . 

n-rH= 

In  statistical  applications,  we  have  a continuous  sampling  situation; 

that  is  to  say  if  m < n,  we  have,  X = X .,*••,  X = X . Then  the  above 

— nl  ml  nm  mm 

theorem  becomes  a more  precise 

Corollary  4.3.  Let  n > m and  the  sequences  be  such  that, 

XrJ  = » j < m.  Then  the  process  Y^( t)  defined  as  before  converges  to  a 

process  Y(t)  with  kernel  R(s,  t)  = lim  R (s,  t)  if  and  only  if 

ruco  n 


1 m 

(4.9)  lim  — =-  2 C .C  . (F  .(F  (t))  A F .(F  (t)) 

' ' r s m mi  1 in'  ra  ' ' ' ' 


s s . , 
m-H-oo  n m i=l 

n-.foo 


m n 


exists  and  is  finite  for  each  t. 

Theorem  4.2  is  proved  by  a direct  application  of  theorem  A,  page  469 
of  Lodve  (1963).  Therefore,  we  shall  only  prove  corollary  4.3. 


j 

■ ■* 

- 


l 
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Proof  of  Corollary  4.5.  Note  that  by  the  hypothesis, 

1 n i m 

F = F , , • • • F = F , m < n.  Lut  F = - S F . and  F = - S F 

1 * * rvm  nwn  — n r»  m m m 


run  mm 


nn-.ni  m m . , ni 

1 i=l 


could  be  different.  Also,  C . may  not  equal  C . and  s and  s could  also 

ni  mi  > n m 

be  different. 

In  order  to  simplify  the  notation,  we  shall  write  u = F ^"(t)  and 

v = F-1(t).  Then 
m 

(4.10)  Y (t)  Y (t) 
n in 

= — — S C .C  . {I(x  . < u)  - F . (u)) (I(X  . < v)  - F .(v)) 

Vm  l<i<n  ni  **  ni  **  ~ n>> 

l<j<m 


= — — S C .C  . fl(X  . < u)  - F .(u))(l(X  . < v)  - F .(v)) 

s S m rrn  <•  ' ni  — ' ni  ' mi  - ' mi' 


s s . , ni  mi  1 ni 

n m 1=1 


+ — £ C .C  . (I(X  . < u)  - F .(u))(l(X  . < v)  - F .(v)} 

s s , . ni  mj  ni  - ni  J 1 mj  — mj  1 1 

n m i=/o 

where  we  have  used  the  fact  that  X . - X . and  F . = F . for  i < m. 

ni  -fra.  ni  mi  — 

Observe  that; 

(i)  When  i / j,  l(Xm^  < •)  and  l(Xmj  < •)  are  independent.  Hence 

E{ [l(X  . < u)  - F . (u)]  [I(X  . < v)  - F .(v)])  = 0. 
ni  — ni  nj  — mj  J 

(ii)  When  i = j. 


(4.11) 


[I(X  . < u)  - F .(u)]  [I(X  . < v)  - F .(v)] 
m - ni  ni  — ni 


= I(X  . < u)  I(X  . < v)  - F .(v)  1(X  . < u) 
' ni  — ' ni  — ' mv  ' v ni  - ' 

= F .(u)  I(X  . < v)  + F . (u)  F . (v) . 
ni  ni  — ni  ni 


In  (4.11)  for  the  first  term, 


E{ l(X  . < u)  l(X  . < v)}  = F .(u  l\  v)  = F .(u)  A F .(v). 
ni  — ni  — ’’  ni  ni  7 ni'  ' 

Taking  expectations  in  (4.11),  we  get 


- **<*»«»  A VF»1(t»  - 

Thus  by  taking  expectations  of  (4.10)  and  substituting  (4.12)  and  (i)  we 
get  the  expression  in  (4.9).  An  application  of  theorem  4.2  proves  the 
corollary. 

5.  Limit  Distributions  of  the  Processes  X (t)  and  Y (t). 

n ' nv  ' 

Since  X^(t)  is  a special  case  of  Y (t),  it  suffices  to  establish  the 
results  for  the  latter.  We  have  the  following  theorem: 

Theorem  $.1.  For  the  processes  Y (t),  let 

(i)  conditions  of  theorem  4.2  (or  of  corollary  4. 3)  hold;  and  let 

(ii)  ( max  |C  |)s"1=  O(y-) . 
l<i<n  nl  n 

Then  Y (t)  > Y(t)  where  Y(t)  is  Gaussian  with  the  kernel 

R( s,  t)  = lim  R ( s,  t) . 
n-*+®  n 

Proof.  convergence  and  the  convergence  of  the  kernels  are  immediate 

consequences  of  condition  (i).  It  suffices  to  prove  the  asymptotic  normality 

of  the  finite  dimensional  distributions  of  Y (t). 

nv  ' 

Let  t^,  tg,  •••,  t be  any  fixed  elements  in  T.  Consider  the  random 
vector  (Y  (t  ),  Y (t  )). 

We  use  the  Cramer-Wold  criterion. 

Let  a^,  a^  be  arbitrary  constants.  It  suffices  to  show  that 


k 

(5.1)  S a.  Y(t.)  is  asymptotically  normal. 

j=l  3 3 


To  simplify  the  notation,  we  write 


J = 1’ 


Then 


(5.2)  £ Y(tj)  = £ a.  (£  — i tKX^.  < u.)  - Fn,(u,)]} 


0=1 


j=l  J i=l  sn  ni  " J 111  3 


n C 


■ SJi  ( s a [1(1  . < u.)  -*>,)]) 


i=l  n 0=1 

Q . 

Let  a . = ni  . Observe  that 
ni  


ni  - o ni'  o' 


(5.3) 


a . = 0(1)  in  n. 
ni  ' 


Thus  we  can  write, 

(5.4)  S a Y(t  ) - J-  s a ( S a [I(X  < u ) - F (u  )]) 
j_  «J  *v  n 1=1  nl  j=l  ^ ^ ^ 


In  (5.4),  observe  that  by  (5.3) » each  random  variable 


is  bounded. 
Let 


a . £ a.[l(X  . < u.)  - F .(u.)] 

nl  j=i  J nl  ~ 0 ni  J 


(5’5)  CTni  ' “ni  Var  ( ?,  aJ  [l(Xni  £ V ' W11 

0—  J- 

and 

2 n 2 

(5.6)  an-Ea^. 

i=i 


Then  by  the  bounded  Liapunov  theorem,  £ a.  Y(t.)  is  asymptotically 
2 . . 0=1  J J 

normal  provided  a -»  + <*>;  if  not,  it  is  degenerate.  In  any  event,  Y (t)  is 
n n 

asymptotically  normal,  possibly  degenerate.  We  shall  examine  degeneracy 
more  closely  in  section  8. 
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6.  Asymptotic  normality  of  Certain  Stochastic  Integrals  of  the 

Processes  X (t)  and  Y (t). 

n'  n 

In  this  section  we  consider  the  stochastic  integrals  X^  and  given  by 


(6.1) 


(6.2) 


where 


(6.3) 


xn  = y — J xn(t)  cp’(t)  dc(t) 


S v0 
n 


Yn  = F1  Yn(t)  V (-t)  dt 
D 

C(t)  = S C , F . (F"1  (t)) 


ni  ni  n 


and  cp  is  a known  function. 

In  the  next  theorem  we  obtain  bounds  for  the  variances  of  X and  Y . 

n n 

This  in  turn  will  enable  us  to  prove  the  asymptotic  normality  of  Sn  defined 
in  (2.1).  The  variance  bound  for  Y^  is  also  of  independent  interest  because 
it  gives  a condition  on  cp  for  the  process  cp'(t)  Y (t)  to  possess  kernels  of 
the  so-called  Hilbert-Schmidt  type,  uniformly  in  n. 

Theorem  6.1.  Let  Xn(t),  Yn(t)  and  their  corresponding  integrals  X^  and 
defined.  Let  the  following  conditions  hold: 

(a)  max  |-^|  = O(j-) 

l<i<n  n v n 

(b)  there  exists  6 > 0 such  that  for  some  K (generic) 


|cp’(t)|  < K(t(l  - t)}6-5/2,  0 < t < 1. 


2 

(6.4)  Var  Yn  < 0(1)  [J'1  |q>'(t)|  (Rn(t,  t)}1/^]2  < \ 25-46 


1 2 

(6.5)  Var  X < 0(1)  [I  | cp ' (t)  | (K  (t,  t)}  1//2dt]2  < K 25"46. 

J 0 £, 


A 


4 
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In  (6.8)  observe  that, 


0 < F . (F_1(t) ) < 1 
— m n v — 


and 


,-l. 


0 < 1 “ Fni(Fn  (t)  < 1- 


Thus 


Rn(t,  t)  < 0(1)  i S Fni(F^(t))  = 0(l)t 
i=l 

hn(t,  t)  < 0(1)  Js  [i  - Fni(F~1( t) ) ] = o( i) ( l - t). 


(6.9) 

and 

(6.10) 

Also, 

(6.11)  (Rn(t,  t))1/2|cp,(t))dt 


i=l 


,V2 


First  consider 


Jq  (Rn(t,  t)}1/2|cp-(t)|dt  + J (Rn(t,  t))1/2|cp*(t)ldt 


|J  ' (Rn(t,  t)}  1/^* (t)|dt)  < o(i)  Jl/2  t1/2  k{ t( l - t))6'3/2dt 

0 J o 

< K f1  t6-1  (i)6-5/2dt  . £ 2J/2-=6 
Jo  ^ 6 


using  (6.9)  and  assumption  (b). 

Similarly,  using  (6.10)  we  get 

.1 


J1/2  K'*’  1 1/2  k'( t)|it<|25/2-2‘. 


Combining  the  two,  we  have  (6.4).  The  proof  of  (6.5)  is  similar. 
We  are  now  in  a position  to  prove  the  following  theorem. 


Theorem  6.2.  Let  Xfi(t) , Yn(t) , Xn  and  Yr  be  as  defined  earlier.  Let 

the  following  conditions  be  satisfied; 

(a)  lim  E(X  (t)  X (t))  and  lim  E(Y  (t)  Y (t) ) exist  and  also 
in  n m n 


finite  for  almost  all  t. 


m,n-.+<x> 


Id 


(b)  max 


= o{~). 


l<i<n  n 


(c)  |{p'(t)|  < K [t(l  - t )}^  for  some  6 > 0 and  a generic  constant  K. 

Then,  given  e > 0,  there  exist  Gaussian  random  variables  X and  Y and  a 
positive  integer  n such  that,  n > n entails 


(6.32) 


llx„  - xll  < e 


(6.15) 


llY„  - 'll  <e- 


Remark.  Condition  (a)  can  be  replaced  by  lim  K (s,  t)  = K(s,  t) 

n-*+co 

where  Kn(s,  t)  = E(Xn(s)  Xn(t))  and  condition  of  corollary  4.2  for  Yfi(t) 
Then  it  follows  from  Shorack  (1973)  and  our  corollary  4.2  respectively 
that  ^^(t)  and  Yfi(t)  converge  in  I*-,  to  Gaussian  processes  X(t)  and  Y(t). 
We  examine  these  conditions  more  closely  in  section  8. 

Proof  of  Theorem  6.2.  Consider  Y (t).  By  theorem  4.1  (or 


corollary  4.2)  and  theorem  5.1>  Y (t) 
kernel 


(6.14)  R(s,  t)  = lim  R (s,  t) 

n-Hco 


Y(t)  where  Y(t)  is  Gaussian  with 


■ llm  h A 4 s < 


n-rnjo  s i=l 
n 


(6.15)  Y = f Y(t)  cp'(t)  dt. 

0 

rl  i-l 

Var  Y = 9'(s)  m'^)  R(s,  t)ds  dt 

J0  J0 

< (f1  |cp'(t)|  (R(t,  t))1/2dt)2 
0 

= (J1  lim  |cp'(t)|  (Rn(t,  t) ) 1^2dt) 2 


0 n-.+® 
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< {lim  inf  J |cp'(t)|  (Rn(t,  t)}1/2^}2  < + . . 

n-»foo  0 

Observe  that  by  theorem  6.1,  Var  Yfi  is  finite  uniformly  in  n. 
Thus 


II Yn  - YH  = IIJo  (Yn(t)  - Y^))  «p* (t)dt|| 

< J ||Yn(t)  - Y(t)||  |cp’(t)|dt  — =►  0 as  n - 

because  ]|Yn(t)  - Y(t)||  |cp'(t)|  > 0 and  is  dominated  by  the  integrable 

sequence, 

dlYn(t)||  - ||Y(t)j|)  |cp'(t)|  . 

Thus, 

|| Y - Y||  < e for  n sufficiently  large. 

The  proof  of  ||Xn  - X||  < e is  similar.  X can  be  taken  to  be, 

i r1 

X = — — X(t)  cp'(t)  dC(t) 

vn  s 

n 

■where  X(t)  is  Gaussian  with  K(s,  t)  = lim  K (s,  t)  and 


rwt«> 


n 


C(t)  = £ C . F . (F-1( t) ) . 
x ' . - m m.'  n v 11 

i=l 


7 . Asymptotic  Normality  of  Simple  Linear  Bank  Statistics. 

We  are  now  in  a position  to  prove  the  asymptotic  normality  of  the 


statistic,  Sn,  defined  in  (2.1),  namely 


R 


S = £ C . «>(-££)  . 
n ni  ^'n+1' 


We  make  the  following  assumptions: 


(a)  |cp(i)(t)|  < K(t( 1 - t))^  1 ^Z2,  i = 0,  1;  K,  6 being  generic  constants. 


i 
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(b)  Xn(t)  and  Y (t)  satisfy  condition  of  theorem  4.2  (or 
corollary  4.3). 

|c  .1  , 

(c)  max  —Si — m 0(±-) , 

l<i<n  (Var  S ) ' v n 

v n' 

We  then  have  the  following  theorem: 

Theorem  7.1.  Let  the  independent  random  variables  X •••  X with 
— nl»  ’ nn 

continuous  distribution  functions  F , , • * * , F be  such  that  conditions 
nx  nn 

(a),  (b),  and  (c)  above  are  satisfied.  Then  there  exists  a Gaussian  random 
variable  S with  mean  0 and  variance  1 such  that 
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(7.7)  H (t)  = i 2 kx 

1=1 


It  is  easily  verified  that  by  linear  approximation,  Sn  is  expressible 


as 

(7.8) 

S = I 
n 1 

where  u 
Mn 

is  as 

(7.9) 

Bl*  = 

(7.10) 

B2n 

(7.11) 

Dln  = 

(7.12) 

D2n 

(7.13) 

°3n  = 

'0  “ 
-1  n 


(7.!3)  (<p(^  Hn(t))  - cp(t)  - ^(t)  - t)  tp’(t))  dCn(t). 


We  will  show  that, 

|nn|  < + 


^ s ^Bln  + B2n^  converges  in  probability  to  a normal  random  variable, 
n 


(11)  — (Bln  + B2n)  converges  i 
n 

(iii>  Din‘  V8-’’  1 ' X’  2>  5- 


The  following  inequalities  are  immediate 


(7.14) 

|cn(t)|  < 

n max  ] 

l<i<n 

(7.15) 

|c(t)|  < 

n max  | 

l<i<n 

Thus  |un| 

< | n max 
l<i<n 

lC  -1  - r 

1 ni1  n 

< n max 

lC  1 I’1 

l<i<n 

m1  J0 

0 i=l 
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Proof  Of  (ii);  Consider  = J1  ?(t)  d(Cn(t)  - C(t)). 

Integration  by  parts  yields, 

(7.16)  B = 1°^*)  “ c(t))'p(t)  - f1  cp'(t)  {C  (t)  - C(t))dt. 

0 J0  n 

— ‘ sn  (Cn(t)  - C(t))  cp(t)  — > 0 in  probability  as  t — > 0 or  1. 

Because  an  argument  similar  to  that  in  the  proof  of  Theorem  6.1  yields 

(TUT)  Var  (TttHCntt)  - C(t)))  £ 0(1)  ^ (t  2^,  ^ 2{t)J 

n 

< K2  min  [t6(l-t)6_1,  t1_6(l-t)6], 

+ 0 < t < 1 which  proves  the  claim. 

Next,  observe  that  (Cn(t)  - C(t))  = Y (t)  andVn(H  (t)  - t)  = Xjt). 

n n 


Hence , 


B 

— = - Y and 
s n 

n 


B, 


2n 


n 


n 


where  Xn  and  Yn  have  been  defined  in  the  preceding  section. 

By  theorem  6.1,  for  each  e > 0,  there  exist  X and  Y Gaussian  such  that, 

)|Xn  - X|]  < e/2  and  ||Yn  - Y)|  < e/2. 

Thus,  ||  (Xn  - Yn)  - (X  - Y)||  < e. 

It  remains  to  show  that  X - Y is  Gaussian.  We  can  write 

z„(t)  - Xn(t)  - Yn(t). 

By  rearranging  terms  of  Xn(t)  and  Yn(t),  Z (t)  can  be  written  as 

S ^ ~TL)  (l(Xni  - Fn1(t))  - 

i=l  vn  Sn  ni  n ni  n ‘ 
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and  conditions  of  the  bounded  Liapunov  theorem  are  satisfied.  Thus  Z (t) 
is  asymptotically  Gaussian. 

Further,  Xfl  - Yn  is  a linear  mapping  on  Zn(t)  and  since  X - Y is  the 

limit  of  Xn  - Yn,  must  be  Gaussian.  This  establishes  the  asymptotic 

normality  of  X - Y . 

n n 

We  shall  examine  the  behavior  of  X - Y more  closely  in  section  8. 
This  proves  (ii). 

Here  we  have  taken  sn  = Var(Xn  - Yfi).  Later  we  find  an  approximation 

for  s . 
n 

Proof  of  (iii):  We  shall  simplify  the  notation  by  writing  x = F-1(t)  and 


Hn<*>  * ap  z<Xnl  £ *> 

1=1 

C„(x)  - Zp  Cni  £ *> 

1=  1 

F (x)  = -A  F . (x) 
nv  ni^l  niv  ' 


C(x)  = S C F (x) 
i=l  nl  ni 


r2  ■ J*  dcn(li) 


Hence 


"•18>  Ir2!  £ I = -p'fW'cj  £ 5 A IV 


n i=l 


(as  C (x)  assigns  measure  C . when  x = X and  since  H (x)  < 1) 
n ni  m n — 


Where, 


(7.i9)  V.  -Hiv.(Fn(Xn.)). 

Then,  it  suffices  to  show  that 


1 n 

- £ |V  .|  — > 0 in  probability  as  n — > + ». 
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This  follows  from  "Particular  case  1°,"  Lodve  (1963),  p.  241,  if  we  show 
n E| V . |a 

that  £ — < + 00,  uniformly  in  n for  some  0 < a < 1.  Take 

i=l  n 


a = 2/3;  then 
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I Hn(x)  - Fn(x)!  l<P'(Fn(x))l  ^ 


where  6*  = 6 - 6'  and  6’  <6  is  chosen. 
Thus  it  suffices  to  show  that, 


(7.20) 


K C(e,  6')  I'00 


J"  (Fn(x)(l  - Fn(x))}6  d(Cn(x)  - C(x) ) 


in  probability. 

We  use  the  Liapunov  criterion  for  degenerate  convergence,  (p.  275, 
B(i),  Lo£ve  (1963)), 


J-  J"  (Fn(x)(l  - F^x))}6*-1  d Cn(x) 


s >/n  -co 
n 


s C . (F  (X  ,)(1  - F (X  .)))* 

r . , m nv  ni  nv  nz'  1 

V n s 1=  1 


*^n  C , 

(7.21)  V . = — (F  (X  ,)(1  - F (X  ,))}6  . 

v ni  s 1 nv  ni  n ni  1 

n 

Then, 


(7.22)  — — S C . (F  (X  ,)(1  - F (X  ,))}6*-1 

' r . , m n ni  n ni  ‘ 

v n s 0=1 
n 


= - E v .. 

n i=l  “i 


1 

It  remains  to  show  — S (V  . - E V . ) — > 0 in  probability.  This  will  be 


n . , ni  ni 

i=l 


done  if  we  show  that  for  some  a > 0, 


(7.25)  S E!v„i|a+1 


n i=l 

We  choose  an  a > 0 such  that,  (l  + a) (6*  - l)  > -1.  (I.e. , 0 < a < ■ — -) , 
n 

Then  -±—  T,  El  V . 1 1+<3£ 
l+a  , , 1 nil 

n i=l 
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, Vn  max I C .1  1+a  n 

i| — Hii|  , £ E{r  (X  )(i  - F(x  .))}(l4C!)(6^1) 


< — . i I 

- a n I s 
n n 


= 0(-i)  ^ £ J”  {Fn(x)(l  - Fn(x))}(l'KX)(6*'1)  dPn.(x) 


n i=l 


= 0(-i)  J {Fn(x)(l  - Fn(x))}(l4a)(6*_1)  d Fn(x)  0 as  n — > + 


n -oo 


because  the  integral  is  hounded  uniformly  for  all  n.  This  proves  (7.23). 


Again  we  have  used  the  fact 


lCnil/sn-  0(1>- 


l<i<n 


This  establishes  = o (s  ).  Consider, 

2n  p n'  ’ 

V - f MjSl  VX»  - 'P'V1'))  - V*>  - V*»  ]«„(*)• 


We  note  the  following:  vrith 


(7.24)  C 


5n  Vn 


it  suffices  to  prove 
(7.25)  C5n=  op(l//n). 


Observe  that 


(7.26)  | CJn|  < 0(1)  J"  Hn(x))  - ,(^(=0)  - (£  „n(x) 


- Fn(x))  cp'(Fn(x))jdHn(x) 


because  of  the  fact 


l<i<n 


l-4-l  - o(J). 

s v n 


I 
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The  proof  that  the  right  hand  s'de  of  (7.26)  is  o (— i)  can  be  found  in 

?vn 

Puri  and  Sen  (1971),  pp.  401-405.  This  proves  theorem  7.1. 


Variance  Computation:  Since  by  (ii)  Din  = °p(sn)>  i = 1,  2,  3,  we  can  take 

s to  be  the  variance  of  B,  + B0  . 
n In 

In  order  to  simplify  our  notation,  we  write  y = F ^"(t).  Observe  that, 


C(t>  - Cnj  Fni  <Fn  <*»  ' ? Cnd  Vy>  * C(y>’ 

J—  -L  J—  J- 


With  these  transformations,  we  can  write 


(7.27)  + B2n  . i J 2 (I(Xnl  < y)  - Fni(y»  ,,’(?„<*))  dc(y) 


-»i=  1 


oo  n 

| SC.  (I(X  . < j) 
J . ni  1 ni  — 


F^y)}  cp'(Fn(y))  ^(y) 


S A .(X  .) 
. , mm' 

i=l 


where 


A .(x) 
ni  ' 


= ^ J < y)  - Fni(y))  q>' (Fn(y) ) dc(y) 


" Cni  J {I(X  ^ y)  " Fni(y))  'P'(Fn(y))  dFn(y) 


- K ?,  Cnj  f [I(x  5 y)  - Fni(y»  <P'<Fn(y»  ^(y) 

J=-L  -00 


- 5 S cni  f (I(*  < y)  - Bni(y»  <p'(Bn(y»  dl'nJ(y) 

J=  J-  -oo 


Thus, 
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(7.28)  An.<*)  - I S (c  - cn.)  f (i(*  < y)  - Fni(y))  „'(Fn(y))dF  (y>. 

J=-L  “CD  u 

Thus,  we  can  express  + Bgn  as  the  sum  of  independent  random 

variables,  A . (X  . ) . If  we  write, 

’ ni  ni 


we  can  then  taXe 


s . = Var  A . (X  . ) , 
ni  ni  ni  ’ 


2 “ 2 
s = £ s . . 
n . = 1 ni 


8.  Asymptotic  Degeneracy  of  S ; 

From  the  expression  (7-8)  for  S , namely, 

n ^n  In  2n  In  2n  3n’ 

it  is  clear  that  we  need  only  examine  the  limiting  behavior  of  the  terms 

B^n  + &2n‘  ^ith  ^he  help  of  the  expression  (7.4)  for  A (•)  we  can  express 
B^n  + Bg^  as  a sum  of  independent  random  variables  which  would  enable 
standard  tools  from  stability  theory  to  be  used.  But  a glance  at  the 
expression  for  A ^(*)  suffices  to  convince  us  of  the  impracticality  of  the 
situation.  We  shall  adopt  a different  procedure  using  the  variance 
inequalities  proved  in  section  6. 

Clearly,  sn  are  only  normalizing  constants  and  we  can  replace  cni/sn 

by  c . themselves  but  now  with  the  condition, 

J ni  ’ 


(8.1) 


Then, 


max  |C  | = 0(-±). 


l<i<n 


(8.2)  X = f1  X (t)  cp'  (t)  dC(t) 
v n J 0 


(8'3)  Yn-  L *»<*>  f'(t) 


where  now 


(8.4)  <n(t),  E C^WX^SF-W) 


F . (F-1(t))l 
m n v ' ' ‘ 


In  view  of  (8.1),  Yn(t)  is  asymptotically  either  Gaussian  or  degenerate. 


We  next  examine  conditions  for  the  degeneracy  (or  non-degeneracy)  of 


X - Y in  the  limit, 
n n 


Assuming  that  the  distribution  functions  F possess  densities 


F'.  = f we  can  express 
iu.ni  r 


(8.5)  X = f cp'(t)  X (t)  dt 

v n n 


where 


(8.6)  c-(t)  = {J  S ^.(F-^t))}'1  ( S cn.  f^  (F~x(t) )) . 


n . . m n 
i=l 


We  then  have  the  following  proposition. 


Proposition.  Let  condition  (8.1)  be  satisfied.  With  X and  Y defined 


by  (8.2)  and  (8.5)  respectively,  S = X^  - Y is 


degenerate 


whenever 


(8.7)  lim  S (^r1  " cni)2  Fni(Fn1(t))(l  " = 0 


n-*+  oo  i=  1 


ni  n 


almost  everywhere  on  (0,  1) . 


It  is,  of  course,  assumed  that  ( p and  cp'  satisfy  the  same  conditions  as 


before. 


1 


Proof.  It  is  easy  to  verify  that 


= X - Y = r cp'(t)  Tj  (^---  - c )}I(X  . < F-1(t) ) 
n n n J „ v . , n ni  ‘ ni  - n ' ' ‘ 


F .(F"1(t)))dt. 
ni  n J 


Hence 


V“  Sn  " f -P't-VW  .=  1^  - A F^t))  - 

V i—  ■*■ 


- KAK1^))  F .(F-^t))]  ds  dt 


ni  n 


< lj1|<p,(t)|{  S 1^-  - Cn.|2  - F^F-V))))1/2^]2. 


By  theorem  6.1,  (variance  inequalities)  the  last  quantity  above  is 
uniformly  bounded  in  N.  Now  it  is  easily  verified  that  (8.7)  entails 
Var  Sn  — > 0.  This  completes  the  proof. 

Remark:  A consequence  of  condition  (8.7)  is  that  whether  Sn  is  asymptotically 

degenerate  or  not  depends  very  heavily  on  the  underlying  distributions  and 
it  may  not  be  possible  to  secure  nondegeneracy  by  any  choice  of  the 
regression  constants  Cni  and  the  score  generating  function  cp.  The  difficulty 
is  that  the  so-called  Kolmogorov-Smirnov  bounds,  namely 

p(sup  jH  p (l(Xn.  < F~1(t) ) - Fni(F~1(t) ))  | > a) 
t ’ n i—  1 

can  be  very  small,  when  the  underlying  distributions  are  nonidentical.  An 
example  following  theorem  2.11.8,  p.  41  of  Puri-Sen  (1971),  illustrates  this 
point. 
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